Optimal affine image normalization approach for optical character recognition
نویسندگان
چکیده
Optical character recognition (OCR) in images captured from arbitrary angles requires preliminary normalization, i.e. a geometric transformation resulting an image as if it was at angle suitable for OCR. In most cases, surface containing characters can be considered flat, and pinhole model adopted camera. Thus, theory, the normalization should projective. Usually, camera optical axis is approximately perpendicular to document surface, so projective replaced with affine one without significant loss of accuracy. An performed significantly faster than which important OCR on mobile devices. this work, we propose fast approach normalization. It utilizes instead there no The based proposed criterion accuracy: root mean square (RMS) coordinate discrepancies over region interest (ROI). problem optimal according considered. We have established that unconstrained optimization quadratic reduced fractional functions integration ROI. latter solved analytically case where ROI consists rectangles. generalized various cases when transform its special are used: scaling, translation, shearing, their superposition, allowing procedure further accelerated.
منابع مشابه
Image Binarization Based On ICA Approach for Optical Character Recognition
Image binarization plays a vital role in text segmentation which is used in OCR application. Binarization of text in degraded images is a challenging task due to the variations in colour, size, and font of the text and the results are often affected by complex backgrounds, different lighting conditions, shadows and reflections. A robust solution to this problem can significantly enhance the acc...
متن کاملImage Normalization and Preprocessing for Gujarati Character Recognition
Pattern recognition has been an important area in computer vision applications. In the case of a planar image, there are four basic forms of geometric distortion caused by the change in camera location: translation, rotation, scaling and skew. So far, a number of methods have been developed to solve these distortions, such as moment invariants’, Fourier descriptor, Hough transformation, shape m...
متن کاملOptical Character Recognition from Text Image
Optical Character Recognition (OCR) is a system that provides a full alphanumeric recognition of printed or handwritten characters by simply scanning the text image. OCR system interprets the printed or handwritten characters image and converts it into corresponding editable text document. The text image is divided into regions by isolating each line, then individual characters with spaces. Aft...
متن کاملGeneralized Affine Invariant Image Normalization
We provide a generalized image normalization technique which basically solved all problems in image normalization. The orientation of any image can be uniquely defined by at most three non-zero generalized complex (GC) moments. The correctness of our method is demonstrated theoretically as well as in practice by applying them to a number of "degenerate" images which have failed other previously...
متن کاملImage Thresholding for Optical Character Recognition and Other Applications Requiring Character Image Extraction
Two new, cost-effective thresholding algorithms for use in extracting binary images of characters from machineor hand-printed documents are described. The creation of a binary representation from an analog image requires such algorithms to determine whether a point is converted into a binary one because it falls within a character stroke or a binary zero because it does not. This thresholding i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Optics
سال: 2021
ISSN: ['2412-6179', '0134-2452']
DOI: https://doi.org/10.18287/2412-6179-co-759